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Abstract 


Maier et al. ( 2010l) introduced the relational 


causal model (RCM) for representing and in¬ 
ferring causal relationships in relational data. 
A lifted representation, called abstract ground 
graph (AGG), plays a central role in reasoning 
with and learning of RCM . The correc t ness of 
the algorithm proposed by iMaier et al.l (l2013ah 
for learning RCM from data relies on the sound¬ 
ness and completeness of AGG for relational d- 
separation to reduce the learning of an RCM 
to learning of an AGG. We revisit the defini- 
tio n of AGG a nd sho w that AGG, as defined 
in Maier et ^ ( 2013bl) . does not correctly ab¬ 


stract all ground graphs. We revise the defini¬ 
tion of AGG to ensure that it correctly abstracts 
all ground graphs. We further show that AGG 
representation is not complete for relational d- 
separation, that is, there can exist conditional in¬ 
dependence relations in an RCM that are not en¬ 
tailed by AGG. A careful examination of the re¬ 
lationship between the lack of completeness of 
AGG for relational t/-separation and faithfulness 
conditions suggests that weaker notions of com¬ 
pleteness, namely adjacency faithfulness and ori¬ 
entation faithfulness between an RCM and its 
AGG, can be used to learn an RCM from data. 


1 INTRODUCTION 

Discovery of causal relationships from observational and 
experimental data is a central problem with applications 
across multiple areas of scientific endeavor. There has been 
considerable progress over the past decades on algorithms 
for eliciting causal rela tionships from data under a broad 
range of assumptio ns ( Pearll 2000t Snirtes et al. . 2000l: 
Shimizu et al.L 2006h . Most algorithms for causal discovery 
assume propositional data where instances are independent 
and identically distributed. However, in many real world 


applications, these assumptions are violated because the 
underlying data has a relational structure of the sort that is 


mode led in practice by an entity-relationship model (IChen . 


1976h . There has been considerable w ork on learning pre 


dictive models from relational data (IGetoor and Taskai 


2007h . Furthermore, researchers from different disciplines 


have studied causal relationships and resultin g phenom¬ 
ena on relational world, e.g., peer e ffects ( Sacerdot^ 


200C Qgburn and VanderWeelel 2014I). social contagio n 


viral marketing (Les 

kovec et al.l 2007 

), and information 

diffusion ( 

Gruhl et al. 

, 2 ()() 4 - 


Motivated by the limitations of traditional approaches to 
learning causal relationships from relational data, Maier 
and his colleagues introdu ced the relational causal model 
(RCM) ( Maier et ^ 201f)f) and provided a sound and com¬ 
plete causal structure learning algorithm, c alled the re 


lationa l causal discovery (RCD) algorithm (IMaier et al. 


2013ah . for inferring causal relationships from relational 
data. The key idea behind RCM is that a cause and its ef¬ 
fects are in a direct or indirect relationship that is reflected 
in the relational data. Traditional approaches for reason¬ 
ing on and learning of a causal mod el cannot be triviall y 
applied for relational causal model ( Maier et'Hl 2013a ). 
Reasoning on an RCM to infer a relational version of con¬ 
ditional independence (Cl) makes use of a lifted representa¬ 
tion, called abstract ground graphs (AGGs), in which tradi¬ 
tional graphical criteria can be used to answer relational Cl 
queries. The lifted representation is employed as an internal 
learning structure in RCD to reflect the inferred Cl results 
among relational version of variables. RCD makes use of a 
new orientation rule designed specifically for RCM. 


Motivation and Contributions RCM (IMaier et al. 


l201(ll) offer an attractive model for representing, rea 
soning about, and learning cau s al rel ationships implicit 
in relational data. Arbour et al. ( 20141) proposed a rela¬ 
tional version of propensity score matching method to 
infer (relational) ca u sal ef fects from observational data. 
Marazopoulou et al. ( 2015h extended RCM to cope with 


temporal relational data. They generalized both RCM 






























































































and RCD to Temporal RCM and Temporal RCD, re¬ 
spectively. A lifted representation, called abstract ground 
graph (AGG), plays a central role in reasoning with 
and learning of RCM. T h e corre ctness of the algorithms 
proposed by Maier et al. _(|2013ah for learning RCM and 

J 


Marazopoulou et al.l (120 15h for Temporal RCM, respec¬ 


tively, from observational data rely on the soundness and 
completeness of AGG for relational d-separation to reduce 
the learning of an RCM to learning of an AGG. The main 
contributions of thi s paper are as follow s: (i) We show that 
AGG, as defined in iMaier et al.l (l2013bh does not correctly 
abstract all ground graphs; (ii) We revise the definition of 
AGG to ensure that it correctly abstracts all ground graphs; 
(iii) We further show that AGG representation is not 
complete for relational (f-separation, that is, there can exist 
conditional independence relations in an RCM that are not 
entailed by AGG; and (iv) Based on a careful examination 
of the relationship between the lack of completeness of 
AGG for relational cZ-separation and faithfulness con¬ 
ditions suggests that weaker notions of completeness, 
namely adjacency faithfulness and orientation faithfulness 
between an RCM and its AGG, can be used to learn an 
RCM from data. 


[Prod, Dev, Emp] .competence —f [Prod] .success 



Figure 1: A toy example of RCM adopted from iMaiei 
( 120141) with two relational dependencies: (i) the success of 
a product depends on the competence of employees who 
develop it; (ii) employee’s salary is affected by his/her com¬ 
petence. 


class Ej in a relationship class Ri is denoted by Ej £ Ri 
= E,. 

Definition 2. A relational skeleton a is an instantiation of 
relational schema S, represented by a graph of entities and 
relationships. Let a (I) denote a set of items of item class 
/ £ I in cr. Let ij, ik & cr such that ij £ cr(/j), ik £ cr{Ik), 
and Ij ,Ik £ T, then we denote ij ^ ik if there exists an 
edge between ij and ik in a. 


2 PRELIMINARIES 


We follow notational conventions introduced in 


(iMaier et al 


.[2013aB 

^ Il976l) a 


Maieiil2014l) . An entity-relationship 


model dChenl 119761) abstracts the entities (e.g., employee. 


product) and relationships (e.g., develops) between entities 
in a domain using a relational schema. The instantiation 
of the schema is called a skeleton where entities form a 
network of relationships (e.g., Quinn-develops-Laptop, 
Roger-develops-Laptop). Entities and relationships have 
attributes (e.g., salary of employees, success of products). 
Cardinality constraints specify the cardinality of relation¬ 
ships that an entity can participate in (e.g., many employees 
can devel op a product. )!! The following definitions are 
taken from iMaiei ( 20141) : 


Definition 1. A relational schema 5 is a tuple 
{£, TZ, A, card): a set of entity classes S', a set of relation¬ 
ship classes TZ where Ri = (Ej)^^^ and n = |i?i| is arity 
for Ri', attribute classes A where A (/) is a set of attribute 
classes of J £ £ U TZ', and cardinalities card : TZ x £ ^ 
{one, many}. 


2.1 RELATIONAL CAUSAL MODEL 


Relational causal model (RCM, Maier et ah . 2010h is a 
causal model where causes and their effects are related 
given an underlying relational schema. For example, the 
success of a product depends on the competence of employ¬ 
ees who develop the product (see Figure[TJ. An RCM mod¬ 
els relational dependencies', each relational dependency 
has a cause and its effect, which are represented by rela¬ 
tional variables', a relational variable is a pair consisting of 
a relational path and an attribute. 


Definition 3. A relational path P = [Ij,..., R] is an al¬ 
ternating sequence of entity class E € £ and relationship 
class R & TZ. An item class Ij is called base class or per¬ 
spective and Ik is called a terminal class. A relational path 
should satisfy: 


1. for every [E, R] or [i?, E], E £ R', 

2. for every [E, R, E'], E E'-, and 

3. for every [R,E,R'], if i? = R', then card {R,E) = 
many. 


Every relationship class Ri have two or more distinct en¬ 
tity classes!! We denote by X all item classes £ \JTZ. We 
denote by Ix an item class that has an attribute class X 
assuming, without loss of generality, that the attributes of 
different item classes are disjoint. Participation of an entity 

*The examples are taken from lMaie 

^In general, the same entity class can participate in a relation¬ 
ship class in two or more different roles. For simplicity, we only 
consider relationship classes only with distinct entity classes. 


All valid relational paths on the given schema S are de¬ 
noted by P 5 . We denote the length of P by |P|, a subpath 
by P"^' = [PkYk=^ or P" = [Pfe]£'i for 1 < z < j < |P|, 
and the reversed path by P = [P|p|,..., P2, Pi] . Note that 
all subpaths of a relational path as well as the correspond¬ 
ing reverse paths are valid. A relational variable P.X is 
a pair of a relational path P and an attribute class X for 
the terminal class of P. A relational variable is said to be 




















































canonical if its relational path has a length equal to 1. A re¬ 
lational dependency is of the form ,..., 1^] .Y 
such that its cause and effect share the same base class and 
its effect is canonical. 

Given a relational schema S, a relational (causal) model 
Aie is a pair of a structure Ai = (S, D), where D is the 
set of relational dependencies, and 0 is a set of parame¬ 
ters. We assume acyclicity of the model so that the attribute 
classes can be partially ordered based on D. The parame¬ 
ters 0 define conditional distributions,p([/].X|Pa([/].X)), 
for each pair (/, AT) where / € I, X G A{I), and 
Pa([/].Ar) is a set of causes of [I].X, i.e., {P.Y\P.Y 
\I].X G D}. This paper focuses on the structure of RCM. 
Hence we often omit parameters 0 from M. 


Terminal Set and Ground Graph Because a skeleton is 
an instantiation of an underlying schema, a ground graph 
is an instantiation of the underlying RCM given a skeleton 
translating relational dependencies to every entity and re¬ 
lationship in the skeleton. It is obtained by interpreting the 
dependencies defined by the RCM on the skeleton using 
the terminal sets of each of the instances in the skeleton. 

Given a relational skeleton a, the terminal set of a rela¬ 
tional path P given a base b G cr(Pi), denoted by P\i,, is 
the set of terminal items reachable from b when we tra¬ 
verse the skeleton along P. Formally, a terminal set Pj;, is 
defined recursively, P^''^\b = {&} and 


P^-\ = {zG aiPe) I jGP^--^-\, 


This implies that P^'^\b and P\b will be disjoint for 1 < 
i < |P|. Restricting the traversals so as not to revisit any 
previously visited items corre sponds to the bridge burning 
semantics (hereinafter, BBS) ( Maier et'Hl 2013bl) . The in¬ 
stantiation of an RCM Ai for a skeleton a yields a ground 
graph which we denote by GGm(t- The vertices of GGm <7 
are labeled by pairs of items and its attribute, {i.X \ I G 
X, iGa{I), 2f eyf(/)}. There exists an edge ij.X ^i^.Y 
in GGm <7 such that ij G ik G a{Ik), Y G A{Ik), 

and X G A{Ij) if and only if there exists a dependency 
P.X -^[Ik].Y G D such that ikGP\ij. 


In essence, RCM models dependencies on relational do¬ 
main as follows: Causal relationships are described from 
the perspective of each item class; and are interpreted for 
each items to determine its causes in a skeleton yielding a 
ground graph. Since an RCM is defined on a given schema, 
RCM is interpreted on a skeleton so that every ground 
graph is an instantiation of the RCM. 


Throughout this paper, unless specified otherwise, we as¬ 
sume a relational schema S, a set of relational dependen¬ 
cies D, and an RCM Ai = {S, D). 


3 REASONING WITH AN RCM 


An RCM can be seen as a meta causal model or a tem¬ 
plate whose instantiation, a ground graph, corresponds to a 
traditional causal model (e.g., a causal Bayesian network). 
Reasoning with causal models relies on conditional inde¬ 
pendence (Cl) relations a mong variab les. Graphical crite¬ 
ria such as £/-separation (Pear], 2000h are often exploited 
to test Cl given a model. Hence, the traditional definitions 
and methods for reasoning with causal models need to be 
“lifted” to the relational setting in order to be applicable to 
relational causal models. 


Definition 4 (Relational (f-separation (iMaieii 120141) 1. Let 
U, V, and W be three disjoint sets of relational variables 
with the same perspective B G X defined over relational 
schema S. Then, for relational model structure Ai, U and 
V are t/-separated by W if and only if, for all skeletons 
cr G Ys, Ulh and V|f, are t/-separated by Wlb in ground 
graph GGmu for all 6 G cr (B). 


There are two things implicit in this definition: (i) all¬ 
ground-graphs semantics which implies that c/-separation 
must be hold over all instantiations of the model; (ii) the 
terminal set items of two different relational variables may 
overlap (which we refer to as intersectability). In other 
words, two relational variables U = P.X and V = P' .X 
of the same perspective B and the same attribute, are said 
to be intersectable if and only if: 




^b£a(B)P\b n P'\b ^ 


( 1 ) 


In order to allo w testing of conditio nal independence on all 
ground graphs. iMaier et alJ (l2013ah introduced an abstract 
ground graph (AGG), which abstracts all ground graphs 
and is able to cope with the intersectability of relational 
variables. We first recapitulate the original definition of 
AGGs. 


3.1 ORIGINAL ABSTRACT GROUND GRAPHS 


An abstract ground graph AGG mb is define d for a given 


relatio nal model Ai and a perspective B G X (iMaier et al. 


2013ah . Since we fix the model, we omit the subscript Ai 


and denote the abstract ground graph for perspective B by 
AGGb- The resulting graph consists of two types of ver¬ 
tices: RV B and IV^; and two types of edges: RVE^ and 

IVEb. 


We denote by RVb the set of all relational variables (RV) 
whose paths originate in B. We denote by RVEb the 
set of all edges between the relational variables in RV b- 
A relational variable edge (RVE) implies direct influence 
arising from one or more dependencies in D. There is an 
RVE P.X Q.Y if there exists a dependency R.X 
[/y].V G D that can be interpreted as a direct influence 
from P.X to Q.Y from perspective B. Such an interpreta¬ 
tion is implemented by an extend function, which takes two 
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Figure 2: A schematic example of how extend is com¬ 
puted showing two relational paths in P n Q where 
card(Ph,iJ 3 ) = many. If card(P;,)is one, then P Xi 
Q is not valid due to rule 3 of relational path. A path 
P N 2 Q is invalid due to the violation of rule 2, i.e., 
[. .., P2, Rb, E2, .. .]. 


relational paths and produces a set of relational paths: If 
P e extend((5, R), then there exists an RVE P.X Q.Y 
where 


extend((5, R) = * + R^'\i G pivots((5, P)}nP5, 

( 2 ) 

pivots(5', T) = and ‘+’ is a concatena¬ 

tion operator. We will use a binary join operator ‘x’ for 
extend and denote + P*’ by Q x^ P for a pivot i. 

A schematic overview of extend is shown in Figure|2] 


We denote by IVb the set of intersection variables (IVs), 
i.e., unordered pairs of intersectable relational variables in 
RVb. Given two RVs P.X and P'.X that are intersectable 
with each other, we denote the resulting intersection vari¬ 
able by P.X n P'.X (Here, the intersection symbol ‘D’ 
denotes inters ectability of t he two relational variables). By 
the definition ( Maier et al. . 2013bh . if there exists an RVE 
P.X Q.Y, then there exist edges P.X HP'.X Q.Y 
and P.X Q.YC\Q'.Y for every P' and Q' intersectable 
with P and Q, respectively. The IVs and the edges that 
connect them with RVs (IVEs) correspond to indirect in¬ 
fluences (arising from intersectability) as opposed to direct 
influence due to dependencies (which are covered by RVs 
and RVEs). We denote by IVEg the set of all such edges 
that connect RVs with IVs. 



Eigure 3: An AGG example excerpted from iMai^ ( 2014ll 
with business unit (Biz) which/wniis (Fund) its products 
from its revenue (rev). The revenue of business units that 
fund the products developed by an employee (IV) is af¬ 
fected by the employee’s co-workers’ competence {X), i.e., 
W jL X. Two are conditionally independent by block¬ 
ing both V and Y. Since IV V is in C7 and Z, both 
W _1L X\{V,U} and W _1L X\{V, Z} hold, which are 
equivalent to {WYX\{V,U})j^ and (W YX\{V, Z})j^, 
respectively. 


tional (i-separation on an AGG. 

We later show that the preceding definition of AGG does 
not properly abstract all ground graphs; nor does it guar¬ 
antee the correctness of reasoning about relational d- 
separation in an RCM. We revise the definition of AGG 
(Section 13.21) so as to ensure that the resulting AGG ab¬ 
stracts all ground graphs. However, we find that even with 
the revised definition of AGG, the AGG representation is 
not complete for relational cZ-separation, that is, there can 
exist conditional independence relations in an RCM that 
are not entailed by AGG (Section 14.1b . A careful exami¬ 
nation of the lack of completeness of AGG for relational 
£/-separation with respect to causal faithfulness yields use¬ 
ful insights that allow us to make use of weaker notions of 
faithfulness to learn RCM from data (Section l4~2l i. 

3.2 ABSTRACT GROUND GRAPHS - A REVISED 
DEFINITION 


Two AGGs with different perspectives share no vertices 
nor edges. Hence, we view all AGGs, {AGGb^b^x, as 
a collection or a single multi-component graph AGG = 
We similarly define RV, IV, RVE, and 
IVE as the unions of their perspective-based counterparts. 

Eor any mutually disjoint sets of relational variables U, V, 
and ’W, one can test U _L V | W, conditional inde¬ 
pendence admitted by the underlying probability distribu¬ 
tion, by checking tj _1L V | W (traditional) tZ-separation 
on an aggH where V includes V and their related IVs, 
V = VU{VnT G IV I V e V}. Eigure [3] illustrates rela- 

^We denote conditional independence by ‘±’ in general. We 
use ‘X’ to represent (traditional) tZ-separation on a directed 
acyclic graph., e.g., AGGai or GGmit. Furthermore, we paren¬ 
thesize conditional independence and use a subscript to specify 
the scope of the conditional independence, if necessary. 


Because of the importance of IV and IVE in AGG in 
reasoning about relational rZ-separation, it is possible that 
errors in abstracting all ground graphs could lead to errors 
in Cl relations inferred from an AGG. We proceed t o show 
that 1 ) the criteria for determining intersectability (IMaiei . 
20141) are not sufficient, and 2) the definition of IVE, as 
it stands, does not guarantee the soundness of AGG as an 
abstract representation of the all ground graphs of an RCM. 
We provide the necessary and sufficient criteria for deter¬ 
mining IVs and a sound definition for IVEs. 


3.2.1 Intersectability and IV 

The declarative characterization of intersectability (Eq. [T]) 
does not offer practical procedural c riteria to dete rmine in¬ 
tersectability. Based on the criteria ( Maieii 2014 1. two dif- 
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Figure 4; Comparison of 1) the necessary condition of the 
existence of an RVE P ^ Q through R, the cause path of a 
dependency (attributes are omitted), 2 ) intersectability be¬ 
tween P and P', and 3) co-intersectability of (Q, R, P, P'). 


b 
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Figure 5: A schematic illustration of Example [T| superim¬ 
posing a skeleton and relational paths. The items for P' 
starting with b should follow a dashed red line, and, hence, 
P' cannot be related to a ground graph edge between ik and 
ij, i.e., an RVE between P and Q (attributes and connec¬ 
tions between entities and relationships are omitted). 


ferent relational paths P and Q are intersectable if and only 
if 1) they share the same perspective, say B € 2, and 2) 
they share the common terminal class, and 3) one path is 
not a prefix of the other. We will prove that the preceding 
criteria are not sufficient. In essence, we will show the con¬ 
ditions under which non-emptiness of P\b fl Q\b for any 
b G <j{B) in any skeleton a always contradicts the BBS. 
Eor the proof, we define LLRSP(P, Q) (the length of the 
longest required shared path) for two relational paths P 
and Q of the common perspective as 

max{£ I P^--^ = = !}• 

LLRSP(P, Q) is computed as follows. Initially set £ = 1 
since Pi=Qi. Repeat incrementing^ by 1 if Pi+i = Qe+i 
and either Pi G TZ or Pi G £ with card(P^, Pi+i) = one. 
Lemma 1. Given a relational schema S, let P and Q 
be two diff erent re l ationa l paths satisfying the (necessary) 
criteria of \Maiei\ ( l2Q74l) and \Q\ < |P|. Let m and n 
be LLRSP(P, Q) and LLRSP(P, Q), respectively. Then, P 
and Q are intersectable if and only if m -f n < \Q\. 


Proof. See Appendix. 


□ 


The lemma demonstrates the criteria by Maieil ( 2014 ) do 
not rule out the case of m -I- n > \Q\ where P and Q cannot 
be intersectable. 


3.2.2 Co-intersectability and IVE 


Based on the definition (iMaieii 120141) . an IVE exists be¬ 
tween an IV, UOV, and an RV, W, if and only if there 
exists an RVE between U and W or V and W. It would 
indeed be appealing to define IV, UnV, such that it inherits 
properties of the corresponding RVs, U and V. However, 
the abstract ground graph resulting from such a definition 
turns out to be not a sound representation of the underlying 
ground graphs. We proceed to prove this result. 


Definition 5 (Co-intersectability). Given a relational 
schema S, let Q, R, P, and P' be valid relational paths 
of the same perspective B where P G Q m R and P and P' 
are intersectable. Then, a tuple {Q, R, P, P') is said to be 
co-intersectable if and only if 


R\i. n P\b n P'\b f 0. (3) 


We relate co-intersectability with the definition of IVE. Let 
an RVE P.X —)> QY is due to some dependencies R.X —> 
[Jy].F G D where P GQviR. This implies 

^aGSs^bea{B)^ijGQ\b R\ij 13 P\b f 0, (4) 

and there are edges from X of nP|t, to Y of Q\b in 
GGcr. In order for the intersectability of P' with P trans¬ 
lates into an influence between P and Q, it is necessary that 
there exists a skeleton that admits such influence. However, 
we can construct a counterexample that satisfies the neces¬ 
sary conditions for the existence of an RVE and the con¬ 
ditions for intersectability but does not satisfy the condi¬ 
tions for co-intersectability (see Eigure|4]for a comparison 
of Eq.|4l[T] and[3). 

Example 1. Let 5 be a relational schema where E = 
{/j-, 4 ,P,Pi,P 2 ,p 3 }. R = {Ri}l=i such that Pi = 
{B,Ei), P 2 = P 3 = {Ei,E 2 ), Ra = 

{E 2 , E 3 , Ik), P 5 = {Ik,Ij) with the cardinality of each 
relationship and each entity in the relationship being one. 
Let 

• Q = [B, Pi, Pi, P 2 , P 3 , P 4 ,4, Rbjij], 

• R = [ 4 , P 5 ,4, Ra, E 3 , P 2 , Pi, P 3 , P 2 , P 4 ,4], 

• P = [P,Pi,Pi,P 3 ,P 2 ,P 4 , 4 ], and 

• P'= [P, Pi, Pi, P 2 , P 3 , P 4 ,4]- 

Observe that 

1. P G extend (Q, P); 

2. P' and P are intersectable; and 

3. P' is a subpath of Q. 

This example satisfies Eq.[T]and Eq.|4] Assume for contra¬ 
diction that there exists a skeleton cr satisfying Eq.[3] Since, 
in this example, the cardinality of each relationship and 
each entity in the relationship is one, for each b G a (B), 
there exists only one ij G Q\b and only one 4 G Pj;,. By 
the assumption, P'\b = {4}- Since P' is a subpath of Q, 
P'\b will end at fy = R^'^Uj Eigure|5]). Due to BBS, 
P|i^. nP^-^li^. = 0, that is, {4}3{4} = 0. This contradicts 
the assumption that 4 = *4 























This counterexample clearly represents there is an inter¬ 
dependency between intersection variables and RVEs. 
Therefore, we revise the definition of IVE accompanying 
co-intersectability. 

Definition 6 (IVE). There exists an IVE edge, P.X fl 
P'.X^Q.Y {ox PX^Q.Yf^Q'.Y), if and only if there 
exists a relational path R such that R.X —> G D, 

P G Q N i?, and (Q, P, P, P') (or (P, R, Q, Q')) is co- 
intersectable. 

To determine IVEs, co-intersectability of a tuple can be 
computed by solving a constraint satisfaction problem in¬ 
volving four paths in the tuple. 


Implications of Co-intersectability We investigated the 
necessary and sufficient criteria for intersectability and re¬ 
vised the definition of IVE so as to guarantee that AGG 
correctly abstracts all ground graphs as asse rted (although 
incorrectly) by Theorem4.5.2 ( Maieil 2014 ). The new cri¬ 
terion, called co-intersectability, is especially interesting 
since it describes the interdependency between intersec¬ 
tion variables and related relational variable edges. Sev¬ 
eral of the key results (e.g., soundness and completeness of 
AGG for relational ^f-separation, Theorem 4.5.2 ) and con¬ 
cepts (e.g., (P,/i)-reachability) of Maier ( 20141) are based 
on independence between intersection variables and related 
relational variable edges. Hence, it is useful to carefully 
scrutinize the relationship between AGG and relational d- 
separation. 


4 NON-COMPLETENESS OF AGG FOR 
RELATIONAL D-SEPARATION 



Eigure 6 ; Co-intersectability of {Q, D 2 , S, S') where iy G 
Q\b, iz G D 2 \iy, iz G S\b, and G S'\b. The thick line 
highlights items for S' from b to iz. The red dashed line 
represents the instantiation of an RVE S.Z Q.Y as 
iz-Z iy.Y in a ground graph (attributes are omitted). 


(U^V| W)agg;m A 

{U\b±V\b\W\b)aG^^- 

4.1 A COUNTEREXAMPLE 

The following counterexample shows that AGG is not com¬ 
plete for relational rZ-separation. 

Example. Let S = (f,P, A, card) be a relational 

schema such that; S = R = with 

Ri = {Ei,E2,E4), R2 = {E2,E^), and P 3 = 

{E^,E4,Eb)\ A = {E 2 -.{Y},E:,-.{X},Eb-.{Z]}- 
and V_ReKV_EgflCard (P, P) = one. Let Ad = (5, D) be 
a relational model with 


D = {Di.X ^ [Iy].Y, D 2.Z ^ [Iy].Y} 


We first revisit the definition of relational d-separation. 
Given three disjoint sets of relational variables U, V, and 
W of a common perspective P G I, U and V are rela¬ 
tional d-separated given W, denoted by (U_LV | W)^, 
if and only if 


such that Di — [E 2 , P 2 , P 3 , P 3 , P 4 j Ri j E 2 j R 2 j E 3 ] ^nd 
D 2 = [E 2 ,R 2 ,E 3 ,R 3 ,E 5 ].LetP.X,Q.Y, S.Z, and S'.Z 
be four relational variables of the same perspective B = Ei 
where their relational paths are distinct where 


(U|, X V|, I W|(.)gg;w. • 

Erom Theorem 4.5.4 of dMaier . 20141) . the lifted repre¬ 
sentation AGGai is said to be sound (or complete) for 
relational d-separation of Ad if (traditional) d-separation 
holds on the AGGm with a modified Cl query only when 
(or whenever) relational d-separation holds true. Then, the 
completeness of AGG for relational d-separation can be 
represented as 


• P = [Ei,Ri,E2,R2tE^\, 

• Q =[Ei^RitE4,Rz,E^,R2,E2\, 

• S'= [Pi, Pi, P4, P3, P5], and 

• S' = [El , Pi, P 2 , P 2 , P 3 , P 3 , P 5 ]. 

Given the above example, we can make two claims. 
Claim 1. (P^ JL STP | ^)agg^' 


(U±V|W)„^(UiV|W)^„„^. 


Proof. See Appendix. 


□ 


The completeness can be proved by the construction 
of a skeleton a G E 5 demonstrating d-connection 
(Ujb fL Vjfc I for some b € a {B) if (U JL 

V I VV)agGai- fri other words, we might disprove the 
completeness by showing 


Assuming that AGG is complete for relational d- 
separation, we can infer (P.X / S'.Z | Q.Y)j^ and there 
must exist a pair of a skeleton a and a base b € a (B) 
that satisfies {P.X\b JL S'.Z\b \ Q-YMggj^ ■ However, 
we claim that such a skeleton and base may not exist. 



























Figure 7; A subgraph of ground graphs to represent > 
iy *z- Only this substructure satisfies BBS assumption 
and cardinality constraints. 


Claim 2. There is no cr S S 5 and 6 S cr (B) such that 

{P.X\y ^ S'.Z\t I Q.Y\b)GGj^^ ■ 

Proof. See Appendix. □ 

The counterexample demonstrates that a (f-connection path 
captured in an AGGa 4 might not have a corresponding 
(f-connection path in any ground graph. 

Corollary 1. The revised (as well as the original) abstract 
ground graph for an RCM is not complete for relational 
d-separation. 

It is possible that an additional test can be utilized to check 
whether there exists such a ground graph that can repre¬ 
sent a t/-connection path captured in AGGy^. However, 
the efficiency of such an additional test is unknown and de¬ 
signing such a test is beyond the scope of this paper. 


admitted by A4q and its lifted representation AGG^k are 
not necessarily equivalent (see Corollary [T]i. 


We will relate Af and AGG^r using an analogy of causal 
Markov condition a nd fa ithfulness ( Spirtes et al. . 2000t 
Ramsey and Spirit 2006h interpreting AGGx and M 
as a DAG G and a distribution p, respectively. We first re¬ 
capitulate the definitions for causal Markov condition and 
faithfulness. 

Definition 7 (Caus al Markov Condition 
( Ramsey and Spirtes . 20061) '). Given a set of variables 
whose causal structure can be represented by a DAG 
G, every variable is probabilistically independent of its 
non-effects (non-descendants in G) conditional on its 
direct causes (parents in G). 


The causal Markov condition (i.e., local Markov condi¬ 
tion) is not directly translated into the relationship be¬ 
tween AGGtk and fA since they refer to different vari¬ 
ables. However, the soundness of AGG^r for relational 
(7-separation of JA (i.e., global Markov condition) would 
be sufficient to interpret causal Markov condition between 
AGGa^ and lA. That is. 


V(7.y,iyeRV (£/ JL V- I ^ ([/ X n W)^ 


where U, V, and W are distinct relational variables sharing 
a common perspective. 


Definition 8 (Causal Faithfulness Condition 
( Ramsey and Spirtes . 20061) ). Given a set of variables 
whose causal structure can be represented by a DAG, no 
conditional independence holds unless entailed by the 
causal Markov condition. 


4.2 RELATING NON-COMPLETENESS WITH 
FAITHFULNESS 

In light of the preceding result that AGG is not complete 
for relational li-separation, we proceed to examine the re¬ 
lationship between an RCM and its lifted representation in 
terms of the sets of conditional independence relationships 
that they admit. In RCM, there are several levels of rela¬ 
tionship regarding the sets of conditional independence; 
between the underlying probability distributions and the 
ground graphs, between the ground graphs of an RCM and 
the RCM, and between the RCM and its AGG: 


{p o GG^Qo-jo-eEs Ade o AGGjvi 

In RCM, the causal Markov condition and causal faith¬ 
fulness condition (see below) can be applied between a 
ground graph GGmct and its underlying probability distri¬ 
bution p. Both conditions are assumed for learning an RCM 
from relational data. Relational (/-separation requires a set 
of conditional independence of Ade using those deduced 
from every ground graph GGmso for every a S S 5 . In 
light of the lack of completeness of AGG for relational 
(/-separation, the set of conditional independence relations 


By the counterexample above. Ad is not strictly faithful to 
AGG^, because more conditional independences hold in 
Ad than those entailed by AGG^r. 


4.2.1 Weaker Faithfulness Conditions 


Ramsey and Spirit ( 2006h showed that the two weaker 
types of faithfulness - adjacency-faithfulness and 
orientation-faithfulness - are sufficient to retrieve a 
maximally-oriented causal structure from a data under the 
causal Markov condition. What we have showed is that 
there are more conditional independence hold in Ad than 
those entailed by its corresponding AGG^r- However, 
the two weaker faithfulness conditions hold true (if they 
are appropriately interpreted in an RCM and its lifted 
representation). 


Adjacency-Faithfulness 

Definition _ 9 _ (Adjacency-Faithfulness 

( Ramsey and Spirtes . 20061) ). Given a set of variables 
V whose causal structure can be represented by a DAG 
G, if two variables X, Y are adjacent in G, then they are 
dependent conditional on any subset of V \ {X, Y}. 

















































Let U, V be two distinct relational variables of the same 
perspective B. We limit U and V to be non-intersectable 
to each other. Otherwise, they must not be adjacent to each 
other by the definition of RCM since an edge between inter- 
sectable relational variables yields a feedback in a ground 
graph. If there is an edge U ^ V in AGG^, 


VwcRVB\{(7,y} (U -/-V \ W)^ 


We can construct a skeleton a € S 5 where its correspond¬ 
ing ground graph GGm<t satisfies that U\b and L|{, are sin¬ 
gletons and U\b U V\b are disjoi nt to (RVb \ If/, Lj) |{, 
for b € <J {B). Lemma 4.4.1 by Maiej (l2014h describes 
a method to construct a minimal skeleton to represent U 
and V with a single b G a{B).lt guarantees that U\b 
and Llb are singletons and every relational variable W G 
RVb\{L,L} satisfies kk lb nC /|6 = 0 and IL|bnL|b = 0. 


Orientation-F aithfulness 

Definition _ ^10 _ (Orientation-Faithfulness 

dRamsev and Spirtesi 120061) ). Given a set of variables 
V whose causal structure can be represented by a DAG G, 
let {X, y, Z) be any unshielded triple in G. 


(Ol) if A Y ■(— Z, then X and Z are dependent given 
any subset of V \ {X, Z} that contains Y ; 


(02) otherwise, X and Z are dependent conditional on any 
subset of V \ {X, Z} that does not contain Y. 


Let U, V, and W be three distinct relational variables of 
the same perspective B forming an unshielded triple in 
AGGa^. Similarly, V is not intersectable to both U and 
W. The condition (Ol) can be written as 

^TC-RV b\{u,w}{U -)l-W I T U {V})m 

if edges are oriented as ^ L ^ kk in AGGx- Other¬ 
wise, 


VTCRVi3\{(7W}(^I T \ {^})m 

for the condition (02). Again, constructing a minimal 
skeleton for U, V, and kk guarantees that no T G RV b \ 
{U,V,W} can represent any item in {U\b,V\b,W\b}. 
Thus, the existence of V in the conditional determines 
(in)dependence in the ground graph induced from the min¬ 
imal skeleton. 


Learning RCM with N on-complete AGG RCD (Rela¬ 
tional Causal Discovery. iMaier et al.l ( 2013ah ) is an algo¬ 
rithm for learning the structure of an RCM from relational 
data. In learning RCM, AGG plays a key role; AGG is con¬ 
structed using Cl tests to obtain the relational dependencies 
of an RCM. The lack of completeness of AGG for rela¬ 
tional t/-separation in RCM raises questions about the cor¬ 
rectness of RCD. A careful examination of AGG through 


the lens of faithfulness suggests that adjacency-faithful and 
orientation-faithful conditions can be applied to AGGa 4 
to recover correct partially-oriented dependencies for an 
RCM. However, it is still unclear whether RCD recov¬ 
ers maximally-oriented dependencies with the acyclicity of 
AGG (i.e., relational variables) not the acyclicity of RCM 
(i.e., attribute classes). This raises the possibility of an al¬ 
gorithm for learning the structure of an RCM from rela¬ 
tional data that does not require the intermediate step of 
constructing a lifted representation. 


5 CONCLUDING REMARKS 


There is a growin g intere st in rela t ional causal models 
(Maieretak, 20 id, 2013^ : Maiei . 2014 : Arbour et al. 


2014t iMarazopoulou et al.L 120151) . A lifted representation. 


called abstract ground graph (AGG), plays a central role 
in reasoning with and lear ning of RCM. The c orrectness of 
the algorithm proposed by Maier et ^ ( 2013^ for learning 
RCM from data relies on the soundness and completeness 
of AGG for relational d-separation to reduce the learning 
of an RCM to learning of an AGG . We showed that AGG, 
as defined in ( Maier et'Hl 2013ah . does not correctly ab¬ 
stract all ground graphs. We revised the definition of AGG 
to ensure that it correctly abstracts all ground graphs. We 
further showed that AGG representation is not complete for 
relational (f-separation, that is, there can exist conditional 
independence relations in an RCM that are not entailed by 
AGG. Our examination of the relationship between the lack 
of completeness of AGG for relational ^f-separation and 
faithfulness suggests that weaker notions of completeness, 
namely adjacency faithfulness and orientation faithfulness 
between an RCM and its AGG can be used to learn an 
RCM from data. Work in progress is aimed at: 1) identify¬ 
ing the necessary and sufficient criteria for guaranteeing the 
completeness of AGG for relational t/-separation; 2) estab¬ 
lishing whether the RCD algorithm outputs a maximally- 
oriented RCM even when the completeness of AGG for re¬ 
lational (i-separation does not hold; and 3) devising a struc¬ 
ture learning algorithm that does not rely on a lifted repre¬ 
sentation. 


APPENDIX 


We first prove Lemma[T]in Section [3.2.1l 

Lemma. Given a relational schema S, let P and Q be 
two diff erent re l ationa l paths satisfying the (necessary) cri¬ 
teria of \Maie^ \20l4i) and \Q\ < |P|. Let m and n be 
LLRSP(P, Q) and LLRSP(P, Q), respectively. Then, P 
and Q are intersectable if and only ifm-\-n< \Q\. 


Proof (If part) If m -\- n < \Q\, then we can construct a 
skeleton a such that P|b D Q\b 0 for some b € cr(Pi) 
by adding unique items for Q and for and 

















































complete the skeleton in th e same manner as shown in 
Lemma 3.4.1 dMaier . 20141) . Note that if m + n = \Q\, 
then |P| > IQI + 2 since P ^ Q and a relational path is an 
alternating sequence. This guarantees that there are at least 
two items for 


(Only if part) Let cbe in P|&n(5|b for some arbitrary skele¬ 
ton cr G Eg and b G ct(Pi). Then, there should be two lists 
of items corresponding to P and Q sharing the first m and 
the last n. The condition m + n > |(5| implies Q\b is a 
singleton set. We define 


P — (pi: ■ • ■ : Pmi P|p|—rt + 11 ■ ■ ■ ■:P\P\) 


and 

q = (gi, ■ • ■ 

where {qi} = Q^'^\b and {pe} = P^'^\b for 1 < i < m, 
and p|p|_;+i G P^'*|c for 1 < I < n. We can see that 
Pi — qi = b andp|p| = = c. Moreover, 


Pm = qm= q\Q\-{\Q\-m) = P|P|-(|Q|-m) 

by the definition of LLRSP. If \Q\ < |P|, then m ^ \P\ — 
IQI -f m and mth item for P is repeated at |P| — (IQj — 
m)th, which violates the BBS. Otherwise, it is not the case, 
since |P| = |(5| implies p = q and, hence, P = Q by the 
definition of LLRSP, which contradicts the assumption that 
P and Q are different relational paths. □ 


Proof. Suppose that there exist such a skeleton a and base 
b G ct(P) satisfying {P.X\b JL S'.Z\b \ Q.Y\b)QQj^^. 
Every terminal set for P, Q, and S' given the base must 
not be empty because of the definition of ^f-separation and 
the fact that attribute classes X and Z are connected only 
through Y (i.e., P is a collider). Since every cardinality is 
one, terminal sets must be singletons. Let {i^} = P.X\b, 
{iy} = Q.Y\b, and {i^} = S'.Y\b. Furthermore, since 
and iz must be cZ-connected given iy, GGMa must have 
two edges ix ^ iy iz, which requires ix G Di\i 
and iz G D2\i ■ However, due to BBS and cardinality con¬ 
straints (i.e., one), there exists only one possible structure 
(see Figure|7]l where ix and iz are the cause of iy while sat¬ 
isfying all previously mentioned conditions except {iz} = 
S'.Y\b. In other words, the constraint {iz} = S'.Y\b vi¬ 
olates with the set of the rest of conditions. Hence, there 
exists no such skeleton and base. □ 
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